Learning Visual Behavior for Gesture Analysis

نویسندگان

  • David Wilson
  • Aaron F. Bobick
  • Mubarak Shah
چکیده

Techniques for computing a representation of human gesture from a number of example image sequences are presented. We define gesture to be the class of human motions that are intended to communicate, and visual behavior as the sequence of visual events that make a complete gesture. Two main techniques are discussed: the first computes a representation that summarizes configuration space trajectories for use in gesture recognition. A prototype is derived from a set of training gestures; the prototype is then used to define the gesture as a sequence of states. The states capture both the repeatability and variability evidenced in a training set of example trajectories. The technique is illustrated with a wide range of gesture-related sensory data. The second technique incorporates multiple models into the Hidden Markov Model framework, so that models representing instantaneous visual input are trained concurrently with the temporal model. We exploit two constraints allowing application of the technique to view-based gesture recognition: gestures are modal in the space of possible human motion, and gestures are viewpoint-dependent. The recovery of the visual behavior of a number of simple gestures with a small number of low resolution example image sequences is shown. We consider a number of applications of the techniques and present work currently in progress to incorporate the training of multiple gestures concurrently for a higher level of gesture understanding. A number of directions of future work are presented, including more sophisticated methods of selecting and combining models appropriate for the gesture. Lastly, a comparison of the two techniques is presented. Thesis Supervisor: Aaron F. Bobick Title: Assistant Professor of Computational Vision Learning Visual Behavior for Gesture Analysis by Andrew David Wilson The following people served as readers for this thesis: Reader: Mubarak Shah Associate Professor, Director Computer Vision Laboratory Computer Science Department, University of Central Florida Reader: Whitman Richards Professor of Cognitive Science Head, Media Arts and Sciences Program

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Visual Behavior for Gesture Analysis 1.1 View-based Approach 2 Representation of Gesture 2.1 Multiple Models for Gesture

A state-based method for learning visual behavior from image sequences is presented. The technique is novel for its incorporation of multiple representations into the Hidden Markov Model framework. Independent representations of the instantaneous visual input at each state of the Markov model are estimated concurrently with the learning of the temporal characteristics. Measures of the degree to...

متن کامل

Neural Network Performance Analysis for Real Time Hand Gesture Tracking Based on Hu Moment and Hybrid Features

This paper presents a comparison study between the multilayer perceptron (MLP) and radial basis function (RBF) neural networks with supervised learning and back propagation algorithm to track hand gestures. Both networks have two output classes which are hand and face. Skin is detected by a regional based algorithm in the image, and then networks are applied on video sequences frame by frame in...

متن کامل

Gesture Unit Segmentation Using Spatial-Temporal Information and Machine Learning

Currently, automated gesture analysis is being widely used in different research areas, such as humancomputer interaction or human-behavior analysis. With regard to the latter area in particular, gesture analysis is closely related to studies on human communication. Linguists and psycholinguists analyze gestures from several standpoints, and one of them is the analysis of gesture segments. The ...

متن کامل

Learning English Auxiliary Modal Verbs by Iranian Children

Modal verbs in English are challenging to learn by speakers of other languages. The purpose of thiswas to shed light on the use of gesture in learning English modal verbs by Persian-speaking children.To achieve this, 60 elementary Iranian learners, studying at some institutes in Karaj took part in thisstudy. The participants were randomly put into one experimental group and one control group. T...

متن کامل

Research Summary

My research interest in computer vision includes human activity recognition, human motion analysis, tracking, human identification, gait and gesture recognition, statistical methods for computer vision. I mainly focus on developing robust real-time algorithms based on sound theory for solving realistic computer vision problems for many applications. My research is widely applicable in visual su...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995